Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 11430 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 1.3 MiB |
| Average record size in memory | 122.0 B |
Variable types
| Text | 2 |
|---|---|
| Categorical | 4 |
| Numeric | 8 |
| Unsupported | 1 |
| Boolean | 2 |
Has_Url_Pattern has constant value "" | Constant |
| Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
Entropy is highly overall correlated with Has_File_Path and 1 other fields | High correlation |
Has_File_Path is highly overall correlated with Entropy | High correlation |
Has_Uncommon_Chars is highly overall correlated with Special_Char_Count | High correlation |
Hierarchy_Level is highly overall correlated with Slash_Count | High correlation |
Parameter_Count is highly overall correlated with Entropy | High correlation |
Slash_Count is highly overall correlated with Hierarchy_Level | High correlation |
Special_Char_Count is highly overall correlated with Has_Uncommon_Chars | High correlation |
Has_File_Path is highly imbalanced (68.9%) | Imbalance |
Entropy is highly imbalanced (55.7%) | Imbalance |
status_numerico is uniformly distributed | Uniform |
Dot_Positions is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Number_Count has 6566 (57.4%) zeros | Zeros |
Special_Char_Count has 7510 (65.7%) zeros | Zeros |
Reproduction
| Analysis started | 2024-02-10 00:26:47.433885 |
|---|---|
| Analysis finished | 2024-02-10 00:26:51.234167 |
| Duration | 3.8 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
url
Text
| Distinct | 11429 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 89.4 KiB |
Length
| Max length | 1641 |
|---|---|
| Median length | 439 |
| Mean length | 61.120035 |
| Min length | 12 |
Characters and Unicode
| Total characters | 698602 |
|---|---|
| Distinct characters | 100 |
| Distinct categories | 14 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 11428 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | http://www.crestonwood.com/router.php |
|---|---|
| 2nd row | http://shadetreetechnology.com/V4/validation/a111aedc8ae390eabcfa130e041a10a4 |
| 3rd row | https://support-appleld.com.secureupdate.duilawyeryork.com/ap/89e6a3b4b063b8d/?cmd=_update&dispatch=89e6a3b4b063b8d1b&locale=_ |
| 4th row | http://rgipt.ac.in |
| 5th row | http://www.iracing.com/tracks/gateway-motorsports-park/ |
| Value | Count | Frequency (%) |
| http://stolizaparketa.ru/wp-content/themes/twentyfifteen/css/read/chinavali/index.php?email=_xxx@yyy.com | 7 | 0.1% |
| http://153284594738391.statictab.com/2506080 | 3 | < 0.1% |
| http://e710z0ear.du.r.appspot.com/c:/users/user/downlo | 2 | < 0.1% |
| http://www.paypal-verification.applmanager.com/customer_center/user-478741 | 2 | < 0.1% |
| http://tokokainbandung.com/wp-content/themes/theretailer/inc/addons/login/customer_center/customer-idpp00c672/myaccount/signin | 2 | < 0.1% |
| https://sites.google.com/site/recoveryfbconfirmcontactus | 2 | < 0.1% |
| https://milenyumpark.com.tr/iletisim | 2 | < 0.1% |
| http://www.courgeon-immobilier.fr/wp-content/uploads/2019/07/tpg/fa2acd5487b1bef895a7453e5dc96013 | 2 | < 0.1% |
| https://www.zabor-vn.com/system/csvprice_pro/smart/customer_center/customer-idpp00c354/myaccount/signin | 2 | < 0.1% |
| https://elhagearms.com/jppp/toda | 2 | < 0.1% |
| Other values (11244) | 11407 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 49171 | 7.0% |
| / | 49030 | 7.0% |
| e | 40829 | 5.8% |
| o | 35303 | 5.1% |
| a | 32792 | 4.7% |
| p | 30923 | 4.4% |
| s | 29208 | 4.2% |
| c | 28485 | 4.1% |
| . | 28354 | 4.1% |
| i | 27617 | 4.0% |
| Other values (90) | 346890 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 492123 | |
| Other Punctuation | 95110 | 13.6% |
| Decimal Number | 62318 | 8.9% |
| Uppercase Letter | 30171 | 4.3% |
| Dash Punctuation | 11402 | 1.6% |
| Connector Punctuation | 3688 | 0.5% |
| Math Symbol | 3552 | 0.5% |
| Control | 65 | < 0.1% |
| Open Punctuation | 64 | < 0.1% |
| Close Punctuation | 63 | < 0.1% |
| Other values (4) | 46 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1951 | 6.5% |
| D | 1674 | 5.5% |
| S | 1508 | 5.0% |
| C | 1502 | 5.0% |
| F | 1499 | 5.0% |
| E | 1399 | 4.6% |
| B | 1368 | 4.5% |
| N | 1287 | 4.3% |
| T | 1274 | 4.2% |
| M | 1266 | 4.2% |
| Other values (18) | 15443 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 49171 | 10.0% |
| e | 40829 | 8.3% |
| o | 35303 | 7.2% |
| a | 32792 | 6.7% |
| p | 30923 | 6.3% |
| s | 29208 | 5.9% |
| c | 28485 | 5.8% |
| i | 27617 | 5.6% |
| r | 23269 | 4.7% |
| n | 22460 | 4.6% |
| Other values (17) | 172066 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 49030 | |
| . | 28354 | |
| : | 11749 | 12.4% |
| & | 1855 | 2.0% |
| ? | 1614 | 1.7% |
| % | 1407 | 1.5% |
| ; | 712 | 0.7% |
| @ | 254 | 0.3% |
| # | 50 | 0.1% |
| , | 46 | < 0.1% |
| Other values (3) | 39 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 8092 | |
| 0 | 7889 | |
| 1 | 7375 | |
| 3 | 6154 | |
| 4 | 5713 | |
| 7 | 5622 | |
| 5 | 5578 | |
| 6 | 5386 | |
| 8 | 5319 | |
| 9 | 5190 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 3351 | |
| + | 120 | 3.4% |
| ~ | 78 | 2.2% |
| < | 3 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 45 | |
| [ | 10 | 15.6% |
| { | 9 | 14.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 45 | |
| ] | 10 | 15.9% |
| } | 8 | 12.7% |
Control
| Value | Count | Frequency (%) |
| ‚ | 33 | |
| ƒ | 31 | |
| ‘ | 1 | 1.5% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 17 | |
| ^ | 2 | 10.5% |
Space Separator
| Value | Count | Frequency (%) |
| Â | 2 | |
| 1 |
Other Letter
| Value | Count | Frequency (%) |
| æ‹ | 1 | |
| å‚… | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11402 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3688 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 22 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 522293 | |
| Common | 176307 | 25.2% |
| Han | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 49171 | 9.4% |
| e | 40829 | 7.8% |
| o | 35303 | 6.8% |
| a | 32792 | 6.3% |
| p | 30923 | 5.9% |
| s | 29208 | 5.6% |
| c | 28485 | 5.5% |
| i | 27617 | 5.3% |
| r | 23269 | 4.5% |
| n | 22460 | 4.3% |
| Other values (44) | 202236 |
Common
| Value | Count | Frequency (%) |
| / | 49030 | |
| . | 28354 | |
| : | 11749 | 6.7% |
| - | 11402 | 6.5% |
| 2 | 8092 | 4.6% |
| 0 | 7889 | 4.5% |
| 1 | 7375 | 4.2% |
| 3 | 6154 | 3.5% |
| 4 | 5713 | 3.2% |
| 7 | 5622 | 3.2% |
| Other values (34) | 34927 |
Han
| Value | Count | Frequency (%) |
| æ‹ | 1 | |
| å‚… | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 698466 | |
| None | 134 | < 0.1% |
| CJK | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 49171 | 7.0% |
| / | 49030 | 7.0% |
| e | 40829 | 5.8% |
| o | 35303 | 5.1% |
| a | 32792 | 4.7% |
| p | 30923 | 4.4% |
| s | 29208 | 4.2% |
| c | 28485 | 4.1% |
| . | 28354 | 4.1% |
| i | 27617 | 4.0% |
| Other values (81) | 346754 |
None
| Value | Count | Frequency (%) |
| ‚ | 33 | |
| Â | 33 | |
| Ã | 33 | |
| ƒ | 31 | |
| Â | 2 | 1.5% |
| µ | 1 | 0.7% |
| ‘ | 1 | 0.7% |
CJK
| Value | Count | Frequency (%) |
| æ‹ | 1 | |
| å‚… | 1 |
status_numerico
Categorical
UNIFORM 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 89.4 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11430 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 5715 | |
| 1 | 5715 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 5715 | |
| 1 | 5715 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5715 | |
| 1 | 5715 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11430 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5715 | |
| 1 | 5715 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11430 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5715 | |
| 1 | 5715 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11430 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5715 | |
| 1 | 5715 |
Subdomain_Count
Real number (ℝ)
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0524934 |
| Minimum | 1 |
|---|---|
| Maximum | 14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 89.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 14 |
| Range | 13 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.86109888 |
|---|---|
| Coefficient of variation (CV) | 0.41953795 |
| Kurtosis | 26.648208 |
| Mean | 2.0524934 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.2559518 |
| Sum | 23460 |
| Variance | 0.74149129 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 7803 | |
| 1 | 2058 | 18.0% |
| 3 | 785 | 6.9% |
| 4 | 640 | 5.6% |
| 5 | 96 | 0.8% |
| 6 | 15 | 0.1% |
| 7 | 10 | 0.1% |
| 12 | 9 | 0.1% |
| 8 | 7 | 0.1% |
| 9 | 3 | < 0.1% |
| Other values (3) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 2058 | 18.0% |
| 2 | 7803 | |
| 3 | 785 | 6.9% |
| 4 | 640 | 5.6% |
| 5 | 96 | 0.8% |
| 6 | 15 | 0.1% |
| 7 | 10 | 0.1% |
| 8 | 7 | 0.1% |
| 9 | 3 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 14 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| 12 | 9 | 0.1% |
| 10 | 1 | < 0.1% |
| 9 | 3 | < 0.1% |
| 8 | 7 | 0.1% |
| 7 | 10 | 0.1% |
| 6 | 15 | 0.1% |
| 5 | 96 | 0.8% |
| 4 | 640 |
Domain_Length
Real number (ℝ)
| Distinct | 83 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.090289 |
| Minimum | 4 |
|---|---|
| Maximum | 214 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 89.4 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 15 |
| median | 19 |
| Q3 | 24 |
| 95-th percentile | 42 |
| Maximum | 214 |
| Range | 210 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 10.777171 |
|---|---|
| Coefficient of variation (CV) | 0.51100159 |
| Kurtosis | 69.829931 |
| Mean | 21.090289 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 5.1600778 |
| Sum | 241062 |
| Variance | 116.14742 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 956 | 8.4% |
| 15 | 754 | 6.6% |
| 18 | 731 | 6.4% |
| 17 | 725 | 6.3% |
| 14 | 702 | 6.1% |
| 19 | 586 | 5.1% |
| 20 | 577 | 5.0% |
| 21 | 561 | 4.9% |
| 13 | 531 | 4.6% |
| 22 | 490 | 4.3% |
| Other values (73) | 4817 |
| Value | Count | Frequency (%) |
| 4 | 14 | 0.1% |
| 5 | 16 | 0.1% |
| 6 | 71 | 0.6% |
| 7 | 71 | 0.6% |
| 8 | 61 | 0.5% |
| 9 | 102 | 0.9% |
| 10 | 216 | |
| 11 | 320 | |
| 12 | 370 | |
| 13 | 531 |
| Value | Count | Frequency (%) |
| 214 | 2 | |
| 213 | 3 | |
| 212 | 1 | < 0.1% |
| 211 | 1 | < 0.1% |
| 179 | 1 | < 0.1% |
| 150 | 1 | < 0.1% |
| 122 | 1 | < 0.1% |
| 120 | 1 | < 0.1% |
| 95 | 1 | < 0.1% |
| 87 | 1 | < 0.1% |
Has_File_Path
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 89.4 KiB |
| 1 | |
|---|---|
| 0 | 639 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11430 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 10791 | |
| 0 | 639 | 5.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 10791 | |
| 0 | 639 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 10791 | |
| 0 | 639 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11430 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 10791 | |
| 0 | 639 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11430 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 10791 | |
| 0 | 639 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11430 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 10791 | |
| 0 | 639 | 5.6% |
Parameter_Count
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1565179 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 89.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.81411714 |
|---|---|
| Coefficient of variation (CV) | 0.70393819 |
| Kurtosis | 144.16276 |
| Mean | 1.1565179 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.9373666 |
| Sum | 13219 |
| Variance | 0.66278672 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 10705 | |
| 3 | 435 | 3.8% |
| 2 | 157 | 1.4% |
| 4 | 54 | 0.5% |
| 10 | 16 | 0.1% |
| 5 | 16 | 0.1% |
| 7 | 12 | 0.1% |
| 6 | 10 | 0.1% |
| 11 | 7 | 0.1% |
| 9 | 6 | 0.1% |
| Other values (5) | 12 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 10705 | |
| 2 | 157 | 1.4% |
| 3 | 435 | 3.8% |
| 4 | 54 | 0.5% |
| 5 | 16 | 0.1% |
| 6 | 10 | 0.1% |
| 7 | 12 | 0.1% |
| 8 | 3 | < 0.1% |
| 9 | 6 | 0.1% |
| 10 | 16 | 0.1% |
| Value | Count | Frequency (%) |
| 20 | 2 | < 0.1% |
| 18 | 1 | < 0.1% |
| 17 | 2 | < 0.1% |
| 12 | 4 | < 0.1% |
| 11 | 7 | |
| 10 | 16 | |
| 9 | 6 | 0.1% |
| 8 | 3 | < 0.1% |
| 7 | 12 | |
| 6 | 10 |
Slash_Count
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 22 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.2895888 |
| Minimum | 2 |
|---|---|
| Maximum | 33 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 89.4 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 33 |
| Range | 31 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.8822513 |
|---|---|
| Coefficient of variation (CV) | 0.43879528 |
| Kurtosis | 18.302403 |
| Mean | 4.2895888 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.7313119 |
| Sum | 49030 |
| Variance | 3.54287 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 4252 | |
| 4 | 2675 | |
| 5 | 1765 | |
| 6 | 881 | 7.7% |
| 2 | 639 | 5.6% |
| 7 | 548 | 4.8% |
| 8 | 269 | 2.4% |
| 9 | 183 | 1.6% |
| 10 | 122 | 1.1% |
| 11 | 34 | 0.3% |
| Other values (12) | 62 | 0.5% |
| Value | Count | Frequency (%) |
| 2 | 639 | 5.6% |
| 3 | 4252 | |
| 4 | 2675 | |
| 5 | 1765 | |
| 6 | 881 | 7.7% |
| 7 | 548 | 4.8% |
| 8 | 269 | 2.4% |
| 9 | 183 | 1.6% |
| 10 | 122 | 1.1% |
| 11 | 34 | 0.3% |
| Value | Count | Frequency (%) |
| 33 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 23 | 3 | |
| 21 | 6 | |
| 20 | 2 | < 0.1% |
| 18 | 2 | < 0.1% |
| 17 | 2 | < 0.1% |
| 16 | 2 | < 0.1% |
| 14 | 6 |
Dot_Positions
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 89.4 KiB |
Number_Count
Real number (ℝ)
ZEROS 
| Distinct | 130 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.4521435 |
| Minimum | 0 |
|---|---|
| Maximum | 679 |
| Zeros | 6566 |
| Zeros (%) | 57.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 89.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 5 |
| 95-th percentile | 24 |
| Maximum | 679 |
| Range | 679 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 16.319904 |
|---|---|
| Coefficient of variation (CV) | 2.9933005 |
| Kurtosis | 315.7282 |
| Mean | 5.4521435 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.097102 |
| Sum | 62318 |
| Variance | 266.33925 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6566 | |
| 1 | 660 | 5.8% |
| 2 | 483 | 4.2% |
| 4 | 397 | 3.5% |
| 3 | 379 | 3.3% |
| 6 | 376 | 3.3% |
| 5 | 284 | 2.5% |
| 8 | 224 | 2.0% |
| 7 | 187 | 1.6% |
| 10 | 160 | 1.4% |
| Other values (120) | 1714 | 15.0% |
| Value | Count | Frequency (%) |
| 0 | 6566 | |
| 1 | 660 | 5.8% |
| 2 | 483 | 4.2% |
| 3 | 379 | 3.3% |
| 4 | 397 | 3.5% |
| 5 | 284 | 2.5% |
| 6 | 376 | 3.3% |
| 7 | 187 | 1.6% |
| 8 | 224 | 2.0% |
| 9 | 137 | 1.2% |
| Value | Count | Frequency (%) |
| 679 | 1 | |
| 269 | 2 | |
| 267 | 1 | |
| 256 | 1 | |
| 233 | 1 | |
| 222 | 1 | |
| 220 | 1 | |
| 212 | 1 | |
| 211 | 1 | |
| 201 | 1 |
Hierarchy_Level
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2487314 |
| Minimum | 1 |
|---|---|
| Maximum | 28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 89.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 7 |
| Maximum | 28 |
| Range | 27 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.7740475 |
|---|---|
| Coefficient of variation (CV) | 0.54607391 |
| Kurtosis | 11.540582 |
| Mean | 3.2487314 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.1770892 |
| Sum | 37133 |
| Variance | 3.1472444 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 4306 | |
| 3 | 2676 | |
| 4 | 1764 | |
| 5 | 878 | 7.7% |
| 1 | 639 | 5.6% |
| 6 | 539 | 4.7% |
| 7 | 258 | 2.3% |
| 8 | 188 | 1.6% |
| 9 | 106 | 0.9% |
| 10 | 32 | 0.3% |
| Other values (10) | 44 | 0.4% |
| Value | Count | Frequency (%) |
| 1 | 639 | 5.6% |
| 2 | 4306 | |
| 3 | 2676 | |
| 4 | 1764 | |
| 5 | 878 | 7.7% |
| 6 | 539 | 4.7% |
| 7 | 258 | 2.3% |
| 8 | 188 | 1.6% |
| 9 | 106 | 0.9% |
| 10 | 32 | 0.3% |
| Value | Count | Frequency (%) |
| 28 | 1 | < 0.1% |
| 26 | 1 | < 0.1% |
| 22 | 2 | < 0.1% |
| 19 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 2 | < 0.1% |
| 13 | 3 | < 0.1% |
| 12 | 14 | |
| 11 | 17 |
Domain_Extension
Text
| Distinct | 288 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 89.4 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 3 |
| Mean length | 2.7834646 |
| Min length | 0 |
Characters and Unicode
| Total characters | 31815 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 121 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | com |
|---|---|
| 2nd row | com |
| 3rd row | com |
| 4th row | in |
| 5th row | com |
| Value | Count | Frequency (%) |
| com | 6997 | |
| org | 645 | 5.6% |
| net | 377 | 3.3% |
| ru | 294 | 2.6% |
| uk | 193 | 1.7% |
| de | 134 | 1.2% |
| au | 129 | 1.1% |
| fr | 90 | 0.8% |
| br | 89 | 0.8% |
| io | 88 | 0.8% |
| Other values (277) | 2392 | 20.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 8035 | |
| c | 7280 | |
| m | 7131 | |
| r | 1387 | 4.4% |
| u | 900 | 2.8% |
| e | 881 | 2.8% |
| n | 788 | 2.5% |
| g | 786 | 2.5% |
| t | 674 | 2.1% |
| i | 571 | 1.8% |
| Other values (27) | 3382 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31570 | |
| Decimal Number | 243 | 0.8% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 8035 | |
| c | 7280 | |
| m | 7131 | |
| r | 1387 | 4.4% |
| u | 900 | 2.9% |
| e | 881 | 2.8% |
| n | 788 | 2.5% |
| g | 786 | 2.5% |
| t | 674 | 2.1% |
| i | 571 | 1.8% |
| Other values (16) | 3137 | 9.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 49 | |
| 2 | 38 | |
| 5 | 28 | |
| 3 | 23 | |
| 7 | 21 | |
| 8 | 21 | |
| 6 | 20 | |
| 4 | 19 | 7.8% |
| 9 | 12 | 4.9% |
| 0 | 12 | 4.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31570 | |
| Common | 245 | 0.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 8035 | |
| c | 7280 | |
| m | 7131 | |
| r | 1387 | 4.4% |
| u | 900 | 2.9% |
| e | 881 | 2.8% |
| n | 788 | 2.5% |
| g | 786 | 2.5% |
| t | 674 | 2.1% |
| i | 571 | 1.8% |
| Other values (16) | 3137 | 9.9% |
Common
| Value | Count | Frequency (%) |
| 1 | 49 | |
| 2 | 38 | |
| 5 | 28 | |
| 3 | 23 | |
| 7 | 21 | |
| 8 | 21 | |
| 6 | 20 | |
| 4 | 19 | 7.8% |
| 9 | 12 | 4.9% |
| 0 | 12 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31815 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 8035 | |
| c | 7280 | |
| m | 7131 | |
| r | 1387 | 4.4% |
| u | 900 | 2.8% |
| e | 881 | 2.8% |
| n | 788 | 2.5% |
| g | 786 | 2.5% |
| t | 674 | 2.1% |
| i | 571 | 1.8% |
| Other values (27) | 3382 |
Has_Uncommon_Chars
Boolean
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.3 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 7510 | |
| True | 3920 |
Has_Hyphens_Domain
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.3 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 9686 | |
| True | 1744 | 15.3% |
Vowel_Consonant_Ratio
Real number (ℝ)
| Distinct | 958 |
|---|---|
| Distinct (%) | 8.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.41591002 |
| Minimum | 0 |
|---|---|
| Maximum | 1.1176471 |
| Zeros | 24 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 89.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.2 |
| Q1 | 0.33333333 |
| median | 0.41666667 |
| Q3 | 0.5 |
| 95-th percentile | 0.61904762 |
| Maximum | 1.1176471 |
| Range | 1.1176471 |
| Interquartile range (IQR) | 0.16666667 |
Descriptive statistics
| Standard deviation | 0.1277548 |
|---|---|
| Coefficient of variation (CV) | 0.30716934 |
| Kurtosis | 0.20838194 |
| Mean | 0.41591002 |
| Median Absolute Deviation (MAD) | 0.083333333 |
| Skewness | -0.050938555 |
| Sum | 4753.8515 |
| Variance | 0.01632129 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.5 | 509 | 4.5% |
| 0.3333333333 | 451 | 3.9% |
| 0.4 | 278 | 2.4% |
| 0.4285714286 | 210 | 1.8% |
| 0.25 | 178 | 1.6% |
| 0.375 | 178 | 1.6% |
| 0.4444444444 | 142 | 1.2% |
| 0.2857142857 | 139 | 1.2% |
| 0.3636363636 | 131 | 1.1% |
| 0.3529411765 | 121 | 1.1% |
| Other values (948) | 9093 |
| Value | Count | Frequency (%) |
| 0 | 24 | |
| 0.05555555556 | 1 | < 0.1% |
| 0.05882352941 | 2 | < 0.1% |
| 0.0625 | 5 | < 0.1% |
| 0.06666666667 | 14 | |
| 0.06976744186 | 1 | < 0.1% |
| 0.07142857143 | 9 | 0.1% |
| 0.07692307692 | 13 | |
| 0.08333333333 | 20 | |
| 0.08695652174 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.117647059 | 1 | |
| 1 | 1 | |
| 0.9545454545 | 1 | |
| 0.9285714286 | 1 | |
| 0.9 | 1 | |
| 0.875 | 1 | |
| 0.8644067797 | 1 | |
| 0.8333333333 | 2 | |
| 0.8275862069 | 1 | |
| 0.8222222222 | 1 |
Special_Char_Count
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 30 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5682415 |
| Minimum | 0 |
|---|---|
| Maximum | 29 |
| Zeros | 7510 |
| Zeros (%) | 65.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 89.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 9 |
| Maximum | 29 |
| Range | 29 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 3.7843047 |
|---|---|
| Coefficient of variation (CV) | 2.413088 |
| Kurtosis | 16.41615 |
| Mean | 1.5682415 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.7942926 |
| Sum | 17925 |
| Variance | 14.320962 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7510 | |
| 1 | 1279 | 11.2% |
| 2 | 620 | 5.4% |
| 3 | 600 | 5.2% |
| 4 | 248 | 2.2% |
| 5 | 236 | 2.1% |
| 6 | 157 | 1.4% |
| 7 | 119 | 1.0% |
| 8 | 82 | 0.7% |
| 9 | 71 | 0.6% |
| Other values (20) | 508 | 4.4% |
| Value | Count | Frequency (%) |
| 0 | 7510 | |
| 1 | 1279 | 11.2% |
| 2 | 620 | 5.4% |
| 3 | 600 | 5.2% |
| 4 | 248 | 2.2% |
| 5 | 236 | 2.1% |
| 6 | 157 | 1.4% |
| 7 | 119 | 1.0% |
| 8 | 82 | 0.7% |
| 9 | 71 | 0.6% |
| Value | Count | Frequency (%) |
| 29 | 4 | < 0.1% |
| 28 | 4 | < 0.1% |
| 27 | 19 | |
| 26 | 7 | 0.1% |
| 25 | 13 | |
| 24 | 15 | |
| 23 | 12 | |
| 22 | 14 | |
| 21 | 25 | |
| 20 | 14 |
Entropy
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 89.4 KiB |
| 0.6666666666666666 | |
|---|---|
| 0.8333333333333334 | |
| 0.5 | 637 |
| 1.0 | 5 |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 17.15748 |
| Min length | 3 |
Characters and Unicode
| Total characters | 196110 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.6666666666666666 |
|---|---|
| 2nd row | 0.6666666666666666 |
| 3rd row | 0.8333333333333334 |
| 4th row | 0.5 |
| 5th row | 0.6666666666666666 |
Common Values
| Value | Count | Frequency (%) |
| 0.6666666666666666 | 9190 | |
| 0.8333333333333334 | 1598 | 14.0% |
| 0.5 | 637 | 5.6% |
| 1.0 | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.6666666666666666 | 9190 | |
| 0.8333333333333334 | 1598 | 14.0% |
| 0.5 | 637 | 5.6% |
| 1.0 | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 147040 | |
| 3 | 22372 | 11.4% |
| 0 | 11430 | 5.8% |
| . | 11430 | 5.8% |
| 8 | 1598 | 0.8% |
| 4 | 1598 | 0.8% |
| 5 | 637 | 0.3% |
| 1 | 5 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 184680 | |
| Other Punctuation | 11430 | 5.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 147040 | |
| 3 | 22372 | 12.1% |
| 0 | 11430 | 6.2% |
| 8 | 1598 | 0.9% |
| 4 | 1598 | 0.9% |
| 5 | 637 | 0.3% |
| 1 | 5 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11430 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 196110 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 147040 | |
| 3 | 22372 | 11.4% |
| 0 | 11430 | 5.8% |
| . | 11430 | 5.8% |
| 8 | 1598 | 0.8% |
| 4 | 1598 | 0.8% |
| 5 | 637 | 0.3% |
| 1 | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 196110 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 147040 | |
| 3 | 22372 | 11.4% |
| 0 | 11430 | 5.8% |
| . | 11430 | 5.8% |
| 8 | 1598 | 0.8% |
| 4 | 1598 | 0.8% |
| 5 | 637 | 0.3% |
| 1 | 5 | < 0.1% |
Has_Url_Pattern
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 89.4 KiB |
| 1 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11430 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 11430 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 11430 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 11430 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11430 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 11430 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11430 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 11430 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11430 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 11430 |
| Domain_Length | Entropy | Has_File_Path | Has_Hyphens_Domain | Has_Uncommon_Chars | Hierarchy_Level | Number_Count | Parameter_Count | Slash_Count | Special_Char_Count | Subdomain_Count | Vowel_Consonant_Ratio | status_numerico | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Domain_Length | 1.000 | 0.172 | 0.064 | 0.444 | 0.150 | -0.093 | 0.121 | 0.154 | -0.097 | -0.033 | 0.483 | 0.239 | 0.276 |
| Entropy | 0.172 | 1.000 | 0.998 | 0.129 | 0.476 | 0.340 | 0.413 | 0.561 | 0.364 | 0.483 | 0.092 | 0.112 | 0.295 |
| Has_File_Path | 0.064 | 0.998 | 1.000 | 0.015 | 0.174 | 0.413 | 0.121 | 0.063 | 0.412 | 0.169 | 0.024 | 0.050 | 0.023 |
| Has_Hyphens_Domain | 0.444 | 0.129 | 0.015 | 1.000 | 0.017 | -0.034 | 0.125 | 0.156 | -0.036 | 0.018 | 0.177 | 0.099 | 0.211 |
| Has_Uncommon_Chars | 0.150 | 0.476 | 0.174 | 0.017 | 1.000 | 0.348 | 0.368 | 0.335 | 0.359 | 0.973 | 0.015 | 0.193 | 0.192 |
| Hierarchy_Level | -0.093 | 0.340 | 0.413 | -0.034 | 0.348 | 1.000 | 0.336 | 0.151 | 0.990 | 0.346 | -0.092 | 0.294 | 0.232 |
| Number_Count | 0.121 | 0.413 | 0.121 | 0.125 | 0.368 | 0.336 | 1.000 | 0.396 | 0.348 | 0.394 | 0.161 | 0.082 | 0.096 |
| Parameter_Count | 0.154 | 0.561 | 0.063 | 0.156 | 0.335 | 0.151 | 0.396 | 1.000 | 0.171 | 0.346 | 0.181 | 0.151 | 0.213 |
| Slash_Count | -0.097 | 0.364 | 0.412 | -0.036 | 0.359 | 0.990 | 0.348 | 0.171 | 1.000 | 0.359 | -0.091 | 0.292 | 0.230 |
| Special_Char_Count | -0.033 | 0.483 | 0.169 | 0.018 | 0.973 | 0.346 | 0.394 | 0.346 | 0.359 | 1.000 | 0.012 | 0.164 | 0.198 |
| Subdomain_Count | 0.483 | 0.092 | 0.024 | 0.177 | 0.015 | -0.092 | 0.161 | 0.181 | -0.091 | 0.012 | 1.000 | 0.020 | 0.268 |
| Vowel_Consonant_Ratio | 0.239 | 0.112 | 0.050 | 0.099 | 0.193 | 0.294 | 0.082 | 0.151 | 0.292 | 0.164 | 0.020 | 1.000 | 0.113 |
| status_numerico | 0.276 | 0.295 | 0.023 | 0.211 | 0.192 | 0.232 | 0.096 | 0.213 | 0.230 | 0.198 | 0.268 | 0.113 | 1.000 |
| url | status_numerico | Subdomain_Count | Domain_Length | Has_File_Path | Parameter_Count | Slash_Count | Dot_Positions | Number_Count | Hierarchy_Level | Domain_Extension | Has_Uncommon_Chars | Has_Hyphens_Domain | Vowel_Consonant_Ratio | Special_Char_Count | Entropy | Has_Url_Pattern | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | http://www.crestonwood.com/router.php | 0 | 2 | 19 | 1 | 1 | 3 | [10, 22, 33] | 0 | 2 | com | False | False | 0.363636 | 0 | 0.666667 | 1 |
| 1 | http://shadetreetechnology.com/V4/validation/a111aedc8ae390eabcfa130e041a10a4 | 1 | 1 | 23 | 1 | 1 | 5 | [26] | 17 | 4 | com | True | False | 0.827586 | 1 | 0.666667 | 1 |
| 2 | https://support-appleld.com.secureupdate.duilawyeryork.com/ap/89e6a3b4b063b8d/?cmd=_update&dispatch=89e6a3b4b063b8d1b&locale=_ | 1 | 4 | 50 | 1 | 3 | 5 | [23, 27, 40, 54] | 19 | 4 | com | True | True | 0.517241 | 1 | 0.833333 | 1 |
| 3 | http://rgipt.ac.in | 0 | 2 | 11 | 0 | 1 | 2 | [12, 15] | 0 | 1 | in | False | False | 0.300000 | 0 | 0.500000 | 1 |
| 4 | http://www.iracing.com/tracks/gateway-motorsports-park/ | 0 | 2 | 15 | 1 | 1 | 5 | [10, 18] | 0 | 4 | com | False | False | 0.363636 | 0 | 0.666667 | 1 |
| 5 | http://appleid.apple.com-app.es/ | 1 | 3 | 24 | 1 | 1 | 3 | [14, 20, 28] | 0 | 2 | es | False | True | 0.500000 | 0 | 0.666667 | 1 |
| 6 | http://www.mutuo.it | 0 | 2 | 12 | 0 | 1 | 2 | [10, 16] | 0 | 1 | it | False | False | 0.400000 | 0 | 0.500000 | 1 |
| 7 | http://www.shadetreetechnology.com/V4/validation/ba4b8bddd7958ecb8772c836c2969531 | 1 | 2 | 27 | 1 | 1 | 5 | [10, 30] | 21 | 4 | com | True | False | 0.405405 | 1 | 0.666667 | 1 |
| 8 | http://vamoaestudiarmedicina.blogspot.com/ | 0 | 2 | 34 | 1 | 1 | 3 | [28, 37] | 0 | 2 | com | False | False | 0.636364 | 0 | 0.666667 | 1 |
| 9 | https://parade.com/425836/joshwigler/the-amazing-race-host-phil-keoghan-previews-the-season-27-premiere/ | 0 | 1 | 10 | 1 | 1 | 6 | [14] | 8 | 5 | com | False | False | 0.591837 | 0 | 0.666667 | 1 |
| url | status_numerico | Subdomain_Count | Domain_Length | Has_File_Path | Parameter_Count | Slash_Count | Dot_Positions | Number_Count | Hierarchy_Level | Domain_Extension | Has_Uncommon_Chars | Has_Hyphens_Domain | Vowel_Consonant_Ratio | Special_Char_Count | Entropy | Has_Url_Pattern | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 11420 | https://adnanboz.wordpress.com/2012/01/06/how-to-set-up-amazon-ec2-windows-gpu-instance-for-nvidia-cuda-development/ | 0 | 2 | 22 | 1 | 1 | 7 | [16, 26] | 9 | 6 | com | False | False | 0.545455 | 0 | 0.666667 | 1 |
| 11421 | http://www.peoplemakingplaces.com/includes/Support/En/log/signin/customer_center/customer-IDPP00C644/myaccount/signin | 1 | 2 | 26 | 1 | 1 | 11 | [10, 29] | 5 | 10 | com | True | False | 0.476923 | 7 | 0.666667 | 1 |
| 11422 | http://sheetdownload.com/ | 0 | 1 | 17 | 1 | 1 | 3 | [20] | 0 | 2 | com | False | False | 0.428571 | 0 | 0.666667 | 1 |
| 11423 | http://www.dmega.co.kr/dmega/data/qna/sec/page.php?email=ZmFpdGhAc2VtYW50aWMuaW5mbw== | 1 | 3 | 15 | 1 | 1 | 7 | [10, 16, 19, 46] | 4 | 6 | kr | True | False | 0.382979 | 8 | 0.833333 | 1 |
| 11424 | http://www.answers.com/Q/What_are_the_sizes_of_computer_memory | 0 | 2 | 15 | 1 | 1 | 4 | [10, 18] | 0 | 3 | com | True | False | 0.441176 | 3 | 0.666667 | 1 |
| 11425 | http://www.fontspace.com/category/blackletter | 0 | 2 | 17 | 1 | 1 | 4 | [10, 20] | 0 | 3 | com | False | False | 0.357143 | 0 | 0.666667 | 1 |
| 11426 | http://www.budgetbots.com/server.php/Server%20update/index.php?email=USER@DOMAIN.com | 1 | 2 | 18 | 1 | 1 | 5 | [10, 21, 32, 58, 80] | 2 | 4 | com | True | False | 0.488889 | 12 | 0.833333 | 1 |
| 11427 | https://www.facebook.com/Interactive-Television-Pvt-Ltd-Group-M-100230523435650/photos/?ref=page_internal | 0 | 2 | 16 | 1 | 1 | 5 | [11, 20] | 15 | 4 | com | True | False | 0.520833 | 7 | 0.833333 | 1 |
| 11428 | http://www.mypublicdomainpictures.com/ | 0 | 2 | 30 | 1 | 1 | 3 | [10, 33] | 0 | 2 | com | False | False | 0.391304 | 0 | 0.666667 | 1 |
| 11429 | http://174.139.46.123/ap/signin?openid.pape.max_auth_age=0&openid.return_to=https%3A%2F%2Fwww.amazon.co.jp%2F%3Fref_%3Dnav_em_hd_re_signin&openid.identity=http%3A%2F%2Fspecs.openid.net%2Fauth%2F2.0%2Fidentifier_select&openid.assoc_handle=jpflex&openid.mode=checkid_setup&key=a@b.c&openid.claimed_id=http%3A%2F%2Fspecs.openid.net%2Fauth%2F2.0%2Fidentifier_select&openid.ns=http%3A%2F%2Fspecs.openid.net%2Fauth%2F2.0&&ref_=nav_em_hd_clc_signin | 1 | 3 | 14 | 1 | 10 | 4 | [10, 14, 17, 38, 43, 69, 97, 104, 107, 153, 181, 188, 203, 236, 267, 298, 311, 341, 348, 363, 396, 418, 425, 440] | 41 | 3 | 123 | True | False | 0.531818 | 7 | 0.833333 | 1 |
Most frequently occurring
| url | status_numerico | Subdomain_Count | Domain_Length | Has_File_Path | Parameter_Count | Slash_Count | Number_Count | Hierarchy_Level | Domain_Extension | Has_Uncommon_Chars | Has_Hyphens_Domain | Vowel_Consonant_Ratio | Special_Char_Count | Entropy | Has_Url_Pattern | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | http://e710z0ear.du.r.appspot.com/c:/users/user/downlo | 1 | 4 | 26 | 1 | 1 | 6 | 4 | 5 | com | False | False | 0.52 | 0 | 0.666667 | 1 | 2 |